Fine Grain Parallelisation of Functional Programs for VLIW or Super-scalar Architectures

نویسنده

Marc Pouzet

چکیده

This paper presents a compaction method of functional programs (eg, ML programs) for super-scalars or VLIW architectures. It is a generalisation of the Percolation scheduling system 1] and Perfect Pipelining 2]. It is described by a set of program transformations respecting data-dependences. Instead of managing a control-ow based representation of programs, the compaction is directly applied to functional expressions. It leads to a simple expression of compaction with direct renaming and an eecient implementation. It greedily realizes local and global compaction. The software pipelining principle, initially applied to loops, is extended to general recursive functions. To our knowledge, it is the rst method to compact a functional language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

URSA: A Unified ReSource Allocator for Registers and Functional Units in VLIW Architectures

The division of instruction scheduling and register allocation and assignment into separate phases can adversely affect the performance of these tasks and thus the quality of the code generated for load/store fine grained parallel architectures. Improved performance in one phase can deteriorate the performance of the other phase, possibly resulting in poorer overall performance. In this paper w...

متن کامل

Dynamically Scheduling VLIW Instructions

Very long instruction word (VLIW) machines potentially provide the most direct way to exploit instruction-level parallelism; however, they cannot be used to emulate current general-purpose instruction set architectures. In addition, programs scheduled for a particular implementation of a VLIW model cannot be guaranteed to be binary compatible with other implementations of the same machine model...

متن کامل

DIVA: dual-issue VLIW architecture with media instructions for image processing

According to the demand on enormous multimedia data processing, we have designed a VLIW (Very Long Instruction Word) processor called DIVA(Dual-Issue VLIW Architecture) exploiting the ILP(instruction-level parallelism) in multimedia programs. DIVA processor which can execute two instructions in one cycle supports 86 instructions including 30 media instructions, and has a sub-word execution stru...

متن کامل

Using the parallel complexity of programs to improve compaction

This paper focuses on the compaction of tail-recursive programs | arbitrary programs with condi-tionals, loops and function calls appearing in terminal position | for ne-grain parallel machines like VLIW or super-scalars. An iterative compaction method based on instruction move-up and fold/unfold transformations is studied. In order to speed-up compaction and produce shorter schedules, we propo...

متن کامل

Stream Execution on Embedded Wide-Issue Clustered VLIW Architectures

Very long instruction word(VLIW-) based processors have become widely adopted as a basic building block in modern Systemon-Chip designs. Advances in clustered VLIW architectures have extended the scalability of the VLIW architecture paradigm to a large number of functional units and very-wide-issue widths. A central challenge with wide-issue clustered VLIW architecture is the availability of pr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1994

Fine Grain Parallelisation of Functional Programs for VLIW or Super-scalar Architectures

نویسنده

چکیده

منابع مشابه

URSA: A Unified ReSource Allocator for Registers and Functional Units in VLIW Architectures

Dynamically Scheduling VLIW Instructions

DIVA: dual-issue VLIW architecture with media instructions for image processing

Using the parallel complexity of programs to improve compaction

Stream Execution on Embedded Wide-Issue Clustered VLIW Architectures

عنوان ژورنال:

اشتراک گذاری